What To Do about Missing Values in Time-Series Cross-Section Data
نویسنده
چکیده
Applications of modern methods for analyzing data with missing values, based primarily on multiple imputation, have in the last half-decade become common in American politics and political behavior. Scholars in this subset of political science have thus increasingly avoided the biases and inefficiencies caused by ad hoc methods like listwise deletion and best guess imputation. However, researchers in much of comparative politics and international relations, and others with similar data, have been unable to do the same because the best available imputation methods work poorly with the time-series crosssection data structures common in these fields. We attempt to rectify this situation with three related developments. First, we build a multiple imputation model that allows smooth time trends, shifts across cross-sectional units, and correlations over time and space, resulting in far more accurate imputations. Second, we enable analysts to incorporate knowledge from area studies experts via priors on individual missing cell values, rather than on difficult-to-interpret model parameters. Third, because these tasks could not be accomplished within existing imputation algorithms, in that they cannot handle as many variables as needed even in the simpler cross-sectional data for which they were designed, we also develop a new algorithm that substantially expands the range of computationally feasible data types and sizes for which multiple imputation can be used. These developments also make it possible to implement the methods introduced here in freely available open source software that is considerably more reliable than existing algorithms.
منابع مشابه
Missing data imputation in multivariable time series data
Multivariate time series data are found in a variety of fields such as bioinformatics, biology, genetics, astronomy, geography and finance. Many time series datasets contain missing data. Multivariate time series missing data imputation is a challenging topic and needs to be carefully considered before learning or predicting time series. Frequent researches have been done on the use of diffe...
متن کاملپیشبینی سری زمانی تعداد معلولیتهای مربوط به حوادث ناشی از کار برای بیمه شدگان تأمین اجتماعی بین سالهای 1379 تا 1389 در ایران با استفاده از روش تحلیل باکس جنکینز
Background : Controlling occurrence of accidents in work place has been an interesting subject in all countries worldwide. Financial consequences of these accidents and their economic losses imposed on the involved companies is only one of the insignificant aspects of such damages and when the non-economic but intangible losses to the society are taken into consideration ,these economic damag...
متن کاملAnalgesic Effect of Gabapentin on Post-Operative Pain After Arthroscopic Anterior Cruciate Ligament Reconstruction
To the Editor Mardani-Kivi et al presented results about a triple blinded randomized controlled trial with gabapentin in patients that underwent anterior cruciate ligament (ACL) reconstruction (1). In their manuscript, the introduction section is very illustrative about the subject. With respect to methodology, it is well known that the physical diagnosis of ACL injury is particularly difficult...
متن کاملEconometric Analysis Using Panel Data
Introduction Different types of data are generally available for empirical analysis, namely, time series, cross section, and panel. A data set containing observations on a single phenomenon observed over multiple time periods is called time series (e.g., GDP for several quarters or years). In time series data, both the values and the ordering of the data points have meaning. In cross-section da...
متن کاملVideo Subject Inpainting: A Posture-Based Method
Despite recent advances in video inpainting techniques, reconstructing large missing regions of a moving subject while its scale changes remains an elusive goal. In this paper, we have introduced a scale-change invariant method for large missing regions to tackle this problem. Using this framework, first the moving foreground is separated from the background and its scale is equalized. Then, a ...
متن کامل